AITopics | transfer-based black-box attack

Collaborating Authors

transfer-based black-box attack

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

A Theory of Transfer-Based Black-Box Attacks: Explanation and Implications

Neural Information Processing SystemsDec-24-2025, 09:37:47 GMT

Transfer-based attacks are a practical method of black-box adversarial attacks, in which the attacker aims to craft adversarial examples from a source (surrogate) model that is transferable to the target model. A wide range of empirical works has tried to explain the transferability of adversarial examples from different angles. However, these works only provide ad hoc explanations without quantitative analyses. The theory behind transfer-based attacks remains a mystery.This paper studies transfer-based attacks under a unified theoretical framework. We propose an explanatory model, called the manifold attack model, that formalizes popular beliefs and explains the existing empirical results. Our model explains why adversarial examples are transferable even when the source model is inaccurate. Moreover, our model implies that the existence of transferable adversarial examples depends on the "curvature" of the data manifold, which quantitatively explains why the success rates of transfer-based attacks are hard to improve. We also discuss the expressive power and the possible extensions of our model in general applications.

explanation and implication, name change, transfer-based black-box attack, (6 more...)

Neural Information Processing Systems

Industry: Transportation > Air (0.67)

Technology: Information Technology > Artificial Intelligence (0.64)

Add feedback

A Theory of Transfer-Based Black-Box Attacks: Explanation and Implications (Supplementary Material) Anonymous Author(s) Affiliation Address email

Neural Information Processing SystemsOct-8-2025, 09:05:55 GMT

Our model fulfills criterion 2.

adversarial example, information, proposition 4, (17 more...)

Neural Information Processing Systems

Industry: Transportation > Air (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Towards Deep Learning Models Resistant to Transfer-based Adversarial Attacks via Data-centric Robust Learning

Yang, Yulong, Lin, Chenhao, Ji, Xiang, Tian, Qiwei, Li, Qian, Yang, Hongshan, Wang, Zhibo, Shen, Chao

arXiv.org Artificial IntelligenceOct-15-2023

Transfer-based adversarial attacks raise a severe threat to real-world deep learning systems since they do not require access to target models. Adversarial training (AT), which is recognized as the strongest defense against white-box attacks, has also guaranteed high robustness to (black-box) transfer-based attacks. However, AT suffers from heavy computational overhead since it optimizes the adversarial examples during the whole training process. In this paper, we demonstrate that such heavy optimization is not necessary for AT against transfer-based attacks. Instead, a one-shot adversarial augmentation prior to training is sufficient, and we name this new defense paradigm Data-centric Robust Learning (DRL). Our experimental results show that DRL outperforms widely-used AT techniques (e.g., PGD-AT, TRADES, EAT, and FAT) in terms of black-box robustness and even surpasses the top-1 defense on RobustBench when combined with diverse data augmentations and loss regularizations. We also identify other benefits of DRL, for instance, the model generalization capability and robust fairness.

accuracy, arxiv preprint arxiv, robustness, (13 more...)

arXiv.org Artificial Intelligence

2310.09891

Country:

North America > United States > New Mexico > Bernalillo County > Albuquerque (0.04)
Europe > Czechia > Prague (0.04)
Asia > Middle East > Jordan (0.04)
Asia > China > Shaanxi Province > Xi'an (0.04)

Genre: Research Report > New Finding (0.34)

Industry: Information Technology > Security & Privacy (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback